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Abstract. The cutoff phenomenon describes a sharp transition in the 
convergence of a Markov chain to equilibrium. In recent work, the au- 
thors established cutoff and its location for the stochastic Ising model 
on the d-dimensional torus (Z/nZ) d for any d > 1. The proof used the 
symmetric structure of the torus and monotonicity in an essential way. 

Here we enhance the framework and extend it to general geometries, 
boundary conditions and external fields to derive a cutoff criterion that 
involves the growth rate of balls and the log-Sobolev constant of the 
Glauber dynamics. In particular, we show there is cutoff for stochastic 
Ising on any sequence of bounded-degree graphs with sub-exponential 
growth under arbitrary external fields provided the inverse log-Sobolev 
constant is bounded. For lattices with homogenous boundary, such as 
all-plus, we identify the cutoff location explicitly in terms of spectral 
gaps of infinite- volume dynamics on half-plane intersections. Analogous 
results establishing cutoff are obtained for non-monotone spin-systems 
at high temperatures, including the gas hard-core model, the Potts 
model, the anti-ferromagnetic Potts model and the coloring model. 



1. Introduction 

The total- variation cutoff phenomenon describes a sharp transition in the 
L 1 -distance of a finite Markov chain from equilibrium, dropping abruptly 



from near its maximum to near (see formal definitions in £2.1). Since its 
discovery in the early 80's by Aldous and Diaconis ([4] following p|[l3]) the 
cutoff phenomenon has been believed to be widespread in Markov chains. 
Until very recently, however, rigorous proofs of cutoff were confined to very 
few cases where the stationary measure was well-understood and fairly sim- 
ple, e.g. uniform, 1-dimensional unimodal etc. The focus of this work is on 
cutoff for Glauber dynamics for spin-systems such as the Ising model, chains 
which are natural models for the evolution of these systems in addition to 
being their most practiced sampling methods. Despite extensive study, the 
understanding of the stationary measure in these models remains limited. 
For instance, for the Ising model on Z 3 the value of the critical temperature 
is unknown, as is the basic question of whether spin-spin correlations at 
criticality decay with distance. 
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Peres conjectured in 2004 that Glauber dynamics for the Ising model on 
any sequence of bounded-degree transitive graphs would exhibit cutoff at 
high temperatures, yet even for the Id torus (Ising on a cycle) this was 
open until recently. In the companion paper |30| the authors proved cutoff 
and established its location for Glauber dynamics for the Ising model on the 
(i-dimensional torus {'L/nL) d for any d > 1 at any temperature where the 
stationary measure satisfies the decay-of-correlation condition strong spatial 
mixing (e.g., on Z this condition always holds and on Z 2 it holds all the 
way up to the critical temperature; see ^2] for definitions and background.) 
The proof featured a framework for eliminating the dependencies between 
distant small boxes, with which the L 1 -mixing of the whole system could 
be reduced to L 2 -mixing of its projections onto these boxes. The symmetry 
of the torus, giving any such box the same effect on mixing, played a key 
role in the analysis (e.g., the basic setting of a box in Z rf with free boundary 
conditions was not covered), as did the monotonicity of the Ising model. 

In this work we enhance the above framework to forsake the symmetry 
and monotonicity limitations. Postponing standard definitions to we 
next describe the new results, first for the Ising model and thereafter for 
non-monotone systems such as the gas hard-core model and the Potts model. 

1.1. Results. Theorem Q] below formulates a cutoff criterion for stochas- 
tic Ising on general graphs with arbitrary boundary conditions, interaction 
strengths and external fields (possibly non- uniform) . The criterion factors in 
the growth rate of balls of logarithmic radius and the log-Sobolev constants 
of the induced system on subgraphs of poly-logarithmic diameter. As one ap- 
plication we obtain that on bounded-degree graphs where these log-Sobolev 
constants are bounded away from (e.g., lattices in the strong spatial mixing 
regime) there is cutoff as long as the graphs have sub- exponential growth. 

Theorem 1. Let G be a connected graph on n vertices, let A > 2 be its 
maximal degree, and consider Glauber dynamics for the Ising model on G 
with arbitrary (possibly non-uniform) interactions and external fields. Let 

a s = mm {a s (H) : H CG with diam(#) < log 2 n} , 

p= max \B V (1 10A27 1 lognl) I , 

vev(G) 1 v n 

where a s (H) is the log-Sobolev constant of the dynamics on H with free 
boundary and the original interactions and external field, and B v (r) is the 
subgraph induced on the ball of radius r around the vertex v. Then 

W(e) - W(l - e) < 16Aa^" 2 logp 

for any e > and large enough n. In particular, on any sequence of graphs 
G n where AS.T 1 = O(l) and p = the Glauber dynamics exhibits cutoff. 
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The above theorem allowed for an arbitrary (possibly non-uniform) exter- 
nal field, which in particular encompasses any arbitrary boundary condition. 
As mentioned above, it is known (due to [36] ) that the log-Sobolev constant 
of stochastic Ising on a finite box in Z rf is bounded away from whenever 
there is strong spatial mixing. The growth rate parameter p defined in 
Theorem [T] there satisfies p x logn, whence we deduce the following: 

Corollary 2. Let d > 1 and consider Glauber dynamics for the Ising model 

on a box A C Z rf of side-length n under arbitrary boundary conditions, local 
interactions and external fields. Then throughout the regime of strong spatial 
mixing the dynamics exhibits cutoff with a window o/0(loglogn). 

For instance, the above corollary implies that on Z 2 with free boundary 
conditions the dynamics exhibits cutoff for any /3 < /3 C = | log(l + \/2). 

The proofs of the aforementioned results are not specific to the Ising model 
but rather can be applied to Glauber dynamics for any spin system model 
at high enough temperature, notably including non-monotone systems such 
as the Potts model. This is formalized in the generic Theorem 5.1 (see ^5|, 
from which we can derive the following corollaries: 

Theorem 3. Let d > 1 and consider the Potts model on a box A C Z d 

of side-length n with q > 2 colors, inverse-temperature (3 > satisfying 
(3 < g log ( max {l + ^fzi , (1 + ^) 1 ^ 2d }) an d arbitrary boundary conditions. 
Then the Glauber dynamics exhibits cutoff with a window of 0(log log n). 

Furthermore, the analogous statement also holds for Glauber dynamics 
for the anti-ferromagnetic Potts model as long as > (5 > — \ log(l + 2 dq-i )- 

Theorem 4. Let d > 1 and consider the gas hard-core model on a box AcZ d 
of side-length n with fug acity A < 55=2 (1+ 2it^l) 2d ~ l an ^ arbitrary boundary 
conditions. The heat-bath dynamics has cutoff with window O(loglogn). 

Theorem 5. Let d > 1 and consider the proper coloring model on a box 

Ac ^ of side-length n with q > Ad(d+1) colors and arbitrary boundary con- 
ditions. Then the heat-bath dynamics has cutoff with window O(loglogn). 

Remark. While the above results were stated for Z d , analogous results hold 
for any d-dimensional lattice L d with finite periodicity, such as triangular, 
hexagonal etc., in particular including non-bipartite lattices (in which one 
cannot use the standard monotone reordering procedure of an anti-monotone 
system). Other notable examples for the lattice L rf include: 

(i) a d-dimensional lattice (e.g. Z rf ) with long range interactions (edges 
between any two vertices with distance at most I for some I > 1 fixed) . 

(ii) a product of a d- dimensional lattice with any fixed graph H. 
Similarly, Theorems |3j^5] were stated for the heat-bath dynamics yet the 
proofs hold also for Metropolis-Hastings with suitably modified constants. 
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In all the results listed thus far there was no closed form for the cutoff 
location. The proofs specify this location as a threshold for the cumulative 
L 2 -mixing on poly-logarithmic balls around each vertex, in which the effect 
of different subgraphs can greatly vary in general. When the boundary 
conditions are homogenous (e.g., free or all-plus) we can obtain an explicit 
formula for the cutoff location in terms of spectral gaps of the infinite- volume 
dynamics on half-plane intersections. This is stated next for 2d lattices with 
all-plus boundary. (An analogous statement holds for free/minus boundary.) 

Theorem 6. Consider Glauber dynamics for the Ising model on A C Z 2 ; an 
nxn box with all-plus boundary conditions. Throughout the high temperature 
regime (3 < (3 C = |log(l + V2) cutoff occurs at (A^ A 2\j{)~ 1 \ogn, where 
Aqo and An are the spectral gaps of the dynamics on the infinite-volume 
lattice and the half-plane with all-plus boundary condition, respectively. 



In §[6] we prove the generalization of the above theorem (Theorem |6.1[) 



to Z d for any d. In that generality, the cutoff location is given in terms 
of d separate infinite-volume spectral gaps in Z rf where the corresponding 
boundary is imposed on the half-planes in a subset of the d coordinates. 

Furthermore, as in the remark following Theorems [3]-[5j the arguments 
extend beyond Z d to give the cutoff location for any d-dimensional lattice L d 
with finite periodicity and homogenous boundary conditions. For instance, 
a special case of example (JTTJ) from that remark is the circular ladder graph, 



the product of the cycle Z/nZ with an edge. In 27, Theorem 15.10] it was 
shown that Glauber dynamics for the Ising model on this graph mixes in 
continuous-time 0(n) and noted that the correct order would be O(logn) via 
a block-dynamics argument. Our results imply that in fact there is cutoff in 
this setting and the mixing asymptotics are (2Aoo) _1 logn where Aoo is the 
(explicitly known) spectral gap of Glauber dynamics on the infinite ladder. 

Contrary to the results establishing cutoff, the argument that relates its 
location to infinite- volume spectral-gaps does rely on the monotonicity of the 
Ising model in an essential way. Nevertheless, in the special case of bipartite 
lattices such as Z rf the standard monotone reordering of the partial order on 
lattice configurations extends the arguments also to anti- monotone systems. 
The analogue of Theorem [6] (cutoff and its location in terms of spectral- gaps 
of infinite- volume dynamics on half-plane intersections) thus holds also for 

(i) anti-ferromagnetic Ising model on Z d under all-plus/all- minus/free b.c. 

(ii) gas hard-core model on Z d under all-plus/all-minus/free b.c. 
and all throughout the strong spatial mixing regime. 



1.2. Methods. At high temperatures the fast decay of spatial dependence 
implies that the measure rapidly becomes well mixed locally. Moreover, 
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Figure 1 . Potts model with q = 3 colors on a 64 x 100 hexagonal lattice 
at high temperature (/? = 0.1). On right, update support for the barrier 
dynamics in yellow, intensity of red depicts the age outside the support. 
On left, the coupled configuration wherever uniquely determined. 

propagation of information occurs at a constant rate and so the configuration 
in distant parts of the graph will be close to independent. This suggests that 
a product Markov chain is the right heuristic to understand the convergence 
of high temperature Glauber dynamics. We next discuss showing cutoff for 
such chains, followed by our approach to reduce the problem to that setting. 

1.2.1. An L 1 to L 2 reduction for product chains. An important step in our 
proof is bounding the total variation distance from stationarity of product 
chains by the L 2 -distance of its component chains. This is formalized by the 
next proposition (proved in fj3]) which we believe is of independent interest. 

Proposition 7. Let Xt = (X/,...,X™) be a product chain, i.e. the X\ 's 
are mutually independent ergodic chains with stationary measures m, . . . , ir n 
respectively. Let ir denote the product measure of tt\, . . . ,ir n and define 



7T; 



i2 

li a (w 4 ) 



(1.1) 



Then \\P(Xt G •) — vr|| TV < yWit for any t > and furthermore, for every 
5 > there exists some e > so that the following holds: If for some t > 

max||P(AlG-)-^|| L oo M <£ (1.2) 

then 



7T 



(2$(vW2) - 1) 



(1.3) 



where $ denotes the cumulative distribution function of the standard normal. 
In particular, if for a family of chains DJtt — > then \\P(Xf G •) — vr|| TV — > 
whereas if Tit — > oo and (1.2) holds for e — > then ||P(-Xt G •) 
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Figure 2. Gas hard-core model on a 64 x 128 triangular lattice at low 
fugacity (A = 0.6). The update support is highlighted on the right vs. 
the coupled configuration wherever uniquely determined on the left. 

This identifies the cutoff location for product chains as above as the time 
t such that 9Jlt ~ L and further establishes the total- variation distance from 
equilibrium within the cutoff window to be 2$(v / 9K</2) — 1 = Erf (y/Ttt/8) 
asymptotically, where Erf(x) = 4= e~ l dt is the error- function. 

Example. Let (Xt) be continuous-time random walk on the hypercube Z?j . 
That is, its discrete-time analogue (Kt) flips a uniform coordinate at each 
step, while Xt = Kjsr t for a Poisson random variable Nt ~ Po(t) (eliminating 
periodicity issues). This was one of the original examples of cutoff, which 
occurs here at time \n\ogn as shown by Aldous |2j. Moreover, cutoff occurs 
within a window of 0(n), as follows from explicit bounds of |14| on the 
distance of the lazy discrete-time chain (applies K with probability an d 
is idle otherwise) from equilibrium at time t = ^nlogn + cn. These bounds 
were refined in |8, Theorem 1] to show this distance is Erf (e _2c /\/8) + o(l). 

To reobtain this result for the continuous-time chain via Proposition [7] 
argue as follows. Let (Xt) be the lazy continuous-time chain corresponding 
to the discrete kernel K = (I + K)/2. It is well known (and easy to see) 
that the heat-kernels corresponding to (Xt), (X t ) satisfy H t = H2t, thus it 
suffices to estimate \F(X t G •) — vr|| TV . Yet (Xt) is the product chain of i.i.d. 
chains (XI) with stationary measures that are uniform on {0, 1}, each of 
which is flipped to a uniform state at rate 1/n. One can easily verify that in 
this case ||P(X| G •) — ^i\\ L 2 = e _2t//n . As such, the proposition implies 
that (Xt) has cutoff at ^nlogn. Furthermore, if t = \n\ogn + cn for some 
eel then \¥(X t G ■) -7r|| TV = Erf(e~ c /v / 8) + o(l). Translating this to (X t ) 
recovers the mentioned result that if t = |nlogn + cn for some c£i then 

\F(X t G ■) — vr|| TV = Erf (e~ 2c /V8) + o(l) . 
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Remark. Proposition [7] in fact applies to any two product-measures (without 
requiring that one arises from a Markov process and the other is its station- 
ary measure) and provides an L 1 to 1? reduction of the distance between 
them. One virtue of formulating it in terms of Markov chains (beyond its 
application in this work) is that one can then derive the L°° condition on 
X\ from the analogous I? requirement on Z|, 2 (see, e.g., Corollary 



3.1). 



1.2.2. Breaking dependencies. While the product chain paradigm is useful 
in explaining the presence of cutoff, the key ideas of the proof are methods 
to break the dependencies inherent in the dynamics. Proposition [7] does 
suggest a natural way to establish lower bounds on the mixing time. If one 
takes well separated blocks B{ C V, the projections of the chain onto the 
blocks are essentially independent. Taking X\ = Xt(Bi) in Proposition [7] 
gives lower bounds on the distance from stationarity which are sufficiently 
strong to give sharp enough lower bounds on the mixing time. 

The dependencies of the measure and the chain, however, make sharp 
upper bounds far more challenging. Since projection decreases the total 
variation distance this approach does not yield upper bounds. Despite the 
rapid decay of correlations with distance, it remained a challenging open 
problem to give sharp total variation distance bounds even for Id Ising. 
To overcome this in the case of the Ising model on the torus in 30 we 
introduced a method to bound the total variation distance at time t + s by 
the expected total variation distance at time s of the chain projected onto 
a random set we call the update support. 

Viewing the Markov chain from time t to time t + s as a random function 
fw '■ {+1> — ~^ {+1, — 1}^ where f\y(X t ) = X t+S it may be the case 
that the value of fw is determined by only a subset of the spins of V - 
the smallest such set is the update support. Since by definition the spins 
outside of this set have no effect on the value of the chain at time t + s, the 
total variation distance at time t + s is bounded by the expected projection 



onto the update support at time t. (Lemma 3.3 places this in a more general 
context of random mapping representations for Markov chains.) 

A key step in our analysis is then to analyze the structure of the update 
support. To ensure that well separated vertices are independent we modify 
the map fw to satisfy this property yet in such a way that it can be coupled 
to the true dynamics with high probability. We call this modified process 
the barrier dynamics, constructed so that information cannot travel further 
than a given distance. Using this independence we wish to show that when 
s is sufficiently large, yet still of smaller order than the mixing time, the 
update support is sparse. Namely, the support size decays exponentially 
and it splits into small well separated components w.h.p. (see Figures |l}[3T) . 
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Figure 3. Ising model on a 128 x 256 square lattice at f3 = 0.32. 
Frames depict evolution of the update support of the barrier dynamics 
(support sites in yellow) into microscopic disconnected components. 

While |30| used the symmetries of the torus in numerous ways, in this 
paper we consider graphs and systems with highly inhomogeneous structure 
and external fields. To obtain the sought geometrical properties of the up- 
date support for the Ising model, our sole requirements are sub-exponential 
local growth of neighborhoods and bounds on the log-Sobolev constant for 
the dynamics locally. Together these ensure fast local mixing, exploited 
via an appropriately-tailored variant of the barrier dynamics from |30|. We 
then analyze the structure of the update support by considering which spins 
couple under the monotone coupling through the barrier dynamics. 

The quantitative bounds of Proposition [7] then allow us to compare the 
derived upper and lower bounds. Again, arbitrary geometry and external 
fields may grant much stronger effect to certain parts of the system on mixing 
compared to others (whereas the setting of the torus studied in [30] had 
complete symmetry and the problem reduced to a product of i.i.d. chains). 
Nevertheless, it is possible to show that there exists a well separated subset of 
blocks which dominates an expected sparse set of blocks from the matching 
upper bound. Finding this set of blocks is done in a greedy manner. Overall, 
we identify a time t* at which the dynamics is far from mixed through the 
lower bound and then is well mixed at time t* + s through the upper bound. 

1.2.3. General spin-systems. In £j5] we extend our analysis to the case of 
general spin-systems. Our construction of the function fw described above 
implicitly used a grand-coupling of the chain starting from all possible initial 
configurations. In the case of the Ising model this is the standard monotone 
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coupling, through which the chains from all starting states coalesce rapidly 
(namely, exponentially fast) at high temperatures. For general models, e.g., 
non-monotone systems, we extend the framework to remain valid so long 



as such a grand coupling is available (see Theorem 5.1). For concreteness 
we give a condition, valid for high enough temperatures, under which one 
can construct a grand coupling as required, thus implying cutoff and the 
aforementioned Theorems [3}{5] This approach is akin to methods used in 
Coupling From The Past techniques for perfect simulation. 

1.2.4. Cutoff location in terms of infinite-volume spectral-gaps. In ^6] (see 



Theorem 6.1) we derive the cutoff location for the Ising model on a box in 
Z d under free/all-plus/all- minus boundary conditions. Unlike the setting of 
the torus considered in [30] , the presence of boundary conditions destroys the 
symmetry of the graph and creates a disparity between the mixing effects of 
vertices near the boundary vs. those near the center of the box. As such, the 
cutoff location becomes a function of the spectral gap of the dynamics not 
just on the infinite-volume lattice but also on intersections of half-spaces. 
Our approach is to classify vertices according to their position in the box 
in terms of the number of faces they are close to and to couple the local 
dynamics to the one in the appropriate half-space. We note in passing that 
at present it is unknown whether for instance the spectral-gaps featured 
in Theorem [6] satisfy An < Aoo- (A weak inequality holds, while equality 
would imply that the cutoff location specified there is only a function of Aoo.) 
There are plausible heuristics suggesting that these half-space spectral-gap 
terms are imperative in at least some settings, e.g., models with stronger 
correlations on the boundary than in the bulk due to boundary conditions 



(see 34 for more details). 



2. Preliminaries 

2.1. Mixing and the cutoff phenomenon. The L 1 (or total- variation) 
distance of a Markov chain from equilibrium is one of the most important 
notions of convergence in MCMC theory. For two probability measures v±, vi 
on a finite space f2 the total-variation distance is defined as 

\v\ - ^2||tv = max|z/i(A) - v 2 (A)\ = - ^ Wi(v) - i^(o-)| , 

o-efi 

i.e. half the L 1 -distance between the two measures. Let (Xt) be an ergodic 
finite Markov chain (Xt) with stationary distribution tt. The total-variation 
mixing-time of (Xt), denoted t MIX (e) for < e < 1, is defined to be 

W(e) = inf \ t : max ||P CT0 (X t G •) - 7r|| TV < e \ , 
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where here and in what follows ¥ ao denotes the probability given Xq = a®. A 
family of ergodic finite Markov chains (Xt), indexed by an implicit parameter 
n, is said to exhibit cutoff iff the following sharp transition in its convergence 
to stationarity occurs: 

lim W ^ . =1 for any < e < 1 . (2.1) 

n->oo t Mlx (l — e) 

That is, ^mix(q ; ) — (1 "H o(1))£mix(/3) for any fixed < ol < /? < 1. Addressing 
the asymptotic error in this formulation is the notion of the cutoff window: A 
sequence w n = o(tMix{c ^)) is a cutoff window if i MIX (£) = ^mix 

(l-e)+0(w n ) 

holds for any < e < 1 with an implicit constant that may depend on e. 
Equivalently, if t n and w n are two sequences such that w n = o(t n ), one may 
define that a sequence of chains exhibits cutoff at t n with window w n iff 

lim liminf max ||P CT (X tn - 7Wn G •) - vr|| TV = 1 , 
7— >co n— >co (jo en 

lim lim sup max \\P ao (X tn+ ^ Wn G •) - 7r|| TV = . 

7^00 n^oo cr en 

The cutoff phenomenon was first identified for random transpositions on 
the symmetric group in 13 , and for the riffle-shuffle and random walks on 
the hypercube in |2j. Its name was coined by Aldous and Diaconis in their 
famous paper E], where cutoff was shown for the top-in-at-random card 



shuffling process. See [6j[7 40 and the references therein for more on the 
cutoff phenomenon. Note that in the examples above, as well as in most 
others where cutoff has been rigorously shown, the stationary distribution 
has many symmetries or is essentially one-dimensional (e.g. uniform on the 
symmetric group |13|, uniform on the hypercube [2] and one-dimensional 
birth-and-death chains |16|). 

Establishing cutoff can prove to be challenging already for simple families 
of chains. For instance, even for random walks on random regular graphs 
(where the stationary distribution is uniform) cutoff was only recently veri- 
fied in [32]. Prior to that work there was no known example of a family of 
bounded-degree graphs where the random walk exhibits cutoff. 

2.2. Ising model. The Ising model on a finite graph G with vertex-set 
V and edge-set E is defined as follows. Its set of possible configurations 
is = {±1} , where each configuration corresponds to an assignment of 
plus/minus spins to the sites in V. The probability that the system is in a 
configuration a G VL is given by the Gibbs distribution 



^- P ' V uv&E «ev / 
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where the partition function Z(ft) is a normalizing constant. The parameters 
ft and h are the inverse-temperature and external field respectively; for ft > 
we say that the model is ferromagnetic, otherwise it is anti-ferromagnetic. 
These definitions extend to infinite locally finite graphs (see e.g. [28|[34] ) . 

In full generality the model associates arbitrary (possibly non-uniform) 
interaction strengths to the bonds {ft uv : uv G E} as well as arbitrary 
external fields to different sites {h u : u G V}. 

We denote the boundary of a set A C V as the neighboring sites of A 
in V \ A and call r G {±1} SA a boundary condition. A periodic boundary 
condition on (Z/nZ) d corresponds to a d-dimensional torus of side-length n. 
For Ac V, denote by <r(A) the spins that a assigns to A. Let denote the 
measure on configurations on A given the boundary condition red, that is, 
the conditional measure //(cr(A) G • | a(dA) = r((9A)). We will often refer 
to the projection of this measure onto a subset of the spins A C A (i.e. the 
marginal of on {il}" 4 ) which we will denote by /^XU- The notation fi^ 
will denote the measure on {±1} A under free boundary, i.e. the one obtained 
by setting to all the interactions between A and dA. 

2.3. Spin-system models. In ^5] we extend the cutoff criterion provided 
in Theorem [T] for Glauber dynamics for the Ising model to a general class of 
spin-systems with nearest-neighbor interactions, defined as follows. 

Let G = (V, E) be a finite graph and let 5? be a finite set (spins). Further 
consider functions g u , u : S^ 2 — > MU{— oo} for every bond (u, v) G E (nearest- 
neighbor interactions) and functions h u : 5^ — > M. for all u G V (external 
fields). The corresponding spin system is the probability distribution on the 
set of configuration J>^ v given by 



M(o-) = 7F ex P 



^2 9u,v{(y{u),a{v)) + ^h u {a(u)) 
(u,v)eE u£V 



where Z is a normalizing constant (the partition function). 

The following well-known models were featured in Theorems [3}|5| 

• The g-state Potts model with inverse-temperature (3: 

y = {!,..., q} , g UjV (x,y) = I ^ X = V . ,h = 0. 



otherwise 

(Notice the factor of 2 in the interactions g U)V whose sole purpose is to 



make (3 consistent with the Ising model definition (2.2) when q = 2.) 
The model is ferromagnetic if f3 > and anti-ferromagnetic if ft < 0. 
The proper g-colorings model: the special case of the anti- ferromagnetic 
Potts model when ft = —oo. 
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• The gas hard-core model with fugacity A > 0: 

J — oo x — y — 1 

y = {o,i}, 9 uA^v) = [ otherwise , K{x) = t{ x=1} \og\. 

2.4. Glauber dynamics for the Ising model. Glauber dynamics for 
the Ising model (also dubbed the Stochastic Ising model) is a family of 
continuous-time Markov chains on the state space f2, reversible with respect 
to the Gibbs distribution, given by the generator 

= a) (/(O "/(*)) (2.3) 

zgA 

where a x is the configuration a with the spin at x flipped. The transition 
rates c(x, a) can be chosen arbitrarily subject to certain natural conditions 
(e.g., detailed balance), yet our attention in this work will be focused on the 
two most notable examples of Glauber dynamics: 

(i) Metropolis: c(x, a) = exp ^2f3o{x) Y2 y ~ x a (y)J A 1. 

(ii) Heat-bath: c(x, a) = 1 + exp ^ — 2f3a(x) Y2 y ~x a (y)^) 

Each of these two flavors of Glauber dynamics has an intuitive and useful 
equivalent graphical interpretation: Assign i.i.d. rate-one Poisson clocks to 
the sites, and upon a clock ringing at some site x do as follows: 

(i) Metropolis: flip o~(x) if the new state, a x , has a lower energy (that is, 
n(a x ) > //(c)), otherwise perform the flip with probability fi(a x )/ n(a). 

(ii) Heat-bath: erase o~(x) and replace it with a sample from the conditional 
distribution given the spins at its neighboring sites. 

It is easy to verify that the above chains are indeed ergodic and reversible 
with respect to the Gibbs distribution [i. 

The mixing time of Glauber dynamics has been extensively studied (see 



£2.5 discussing the related relaxation time gap 1 and the references therein), 



yet as for cutoff, until recently the only setting where it was shown was heat- 



bath Glauber dynamics for Ising on the complete graph 15 26 . There, the 
mean- field geometry reduced the problem to the analysis of a Id birth- 
and-death chain (the sum-of-spins) which governs the mixing of the entire 
system. While this further motivated the conjecture of Peres on cutoff for 
the Ising model on lattices (see |26[|27|), it failed to provide insight for the 



dynamics on (Z/nZ) even in the well-understood case of d = 1. 



In the companion paper 30 the authors verified the conjectured cutoff for 
Glauber dynamics for the Ising model on the d-dimensional torus (Z/nZ) rf 
for any d > 1 and throughout the strong spatial mixing regime, which for Z 2 
extends all the way to the critical inverse-temperature fi c = \ log(l + \/2). 
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2.5. Spectral gap and the logarithmic-Sobolev constant. The fol- 
lowing quantities, denned next directly for Glauber dynamics for simplicity, 
provide useful analytic methods for bounding the mixing time of a chain. 
The spectral gap and log-Sobolev constant of the continuous-time Glauber 



dynamics are given by the following Dirichlet form (see, e.g., 34, 39 ): 



gap 



inf ■ 
/ Var(/) 



i / f Ent( / ) 



(2.4) 



where the infimum is over all nonconstant / G L 2 (fi) and 



<*(/) 



a.x 



Ent(/)=E[/ 2 (a)log(/V)/E/V))] 



It is well known (see e.g. [3|[TTj ) that for any finite ergodic reversible Markov 
chain < 2a s < gap and gap -1 < t MIX (l/2e). In our case, since the sites 
are updated via rate-one independent Poisson clocks, we also have gap < 1. 

By bounding the log-Sobolev constant one may obtain remarkably sharp 
upper bounds not only for the total-variation mixing-time but also for the 
L 2 -mixing (cf., e.g., [9-12,39). The following theorem of Diaconis and 
Saloff-Coste |11[ Theorem 3.7] (its form as follows appears in |39, Theorem 



2.2.5], also see (3l Chapter 8]) demonstrates this powerful method. 

Theorem 2.1. Let (If) be a finite reversible continuous-time Markov chain 
with stationary distribution ir. Let s > and for some initial state yo set 



where \a\ 



t = (4a s )" 
aVO. Then 



loglog(l/vr(y )) I +gap 1 s. 



yo 



7T 



< exp(l — s) . 



Starting from the late 1970's, a series of seminal papers by Aizenman, 
Dobrushin, Holley, Shlosman, Stroock et al. (see, e.g., (T}[l7}[2l]-[23}[28}[29} 
36-38 41-43 46,471) has developed the theory of the convergence rate of 



Glauber dynamics to stationarity at the high temperature regime. For more 
details on these works the reader is referred to the companion paper |30| as 
, while in what follows we describe the result of Martinelli 



35 



37 on the log-Sobolev constant, which is essential to our 



well as to (31 , 34 
and Olivieri |36 
proofs. 

2.6. Strong spatial mixing and logarithmic-Sobolev inequalities. 

Bounds on the log-Sobolev constant of the Glauber dynamics for the Ising 
model (as well as other spin systems) were proved under a variety of increas- 
ingly general spatial mixing conditions. We will work under the assumption 
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of strong spatial mixing (or regular complete analyticity) introduced by 
Martinelli and Olivieri |36| as it holds for the largest known range of (3. 



Definition 2.2. For a set AcZ d and constants C, c > we say that the 
property SM(A, C, c) holds if for any A C A, 



sup 



MaIa 



Ma |a 



< (7e _cdist ^' A - ) , 



where the supremum is over all y £ d A and r 6 {±1} SA and where h\\a is 
the projection of the measure ii\ onto A. We say that strong spatial mixing 
holds for the Ising model with inverse temperature (3 and external field h on 
Z d if there exist C, c > such that SM(A, C, c) holds for any cube A. 

The above definition implies uniqueness of the Gibbs measure on the 
infinite lattice. Moreover, strong spatial mixing holds for all temperatures 
when d = 1 and for d = 2 it holds whenever h ^ or /3 < (5 C . As discussed 
in the introduction, this condition further implies a uniform lower bound 
on the log-Sobolev constant of the Glauber dynamics on cubes under any 
boundary condition r (see 34,36,37]). The proof of Theorem [6] will make 
use of the next generalization of this result to periodic boundary conditions, 
i.e. the dynamics on the torus, obtained by following the original arguments 
as given in |34| with minor alterations (see also (5j[l9[[29[[35j ) . 

Theorem 2.3. Suppose that the inverse-temperature (3 and external field h 
are such that the Ising model on 7j d has strong spatial mixing. Then there 
exists a constant a s = a s (f3,h) > such that the Glauber dynamics for 
the Ising model on {'L/nL) d with any combination of arbitrary and periodic 
boundary conditions has a log-Sobolev constant at least a s independent of n. 

3. Mixing in L 1 and L 2 

3.1. Proof of Proposition [7[ To simplify the notation throughout the 
proof we write v = F(X t G •) and z/j = P(A] G •) for i = 1, . . . ,n. 

We will first show that \\v — 7r|| < \/9Jtf Note that by Cauchy-Schwarz, 



1 



\V — 7T 



1 



and at the same time, letting denote expectation w.r.t. 7r, we have 



\V — 7T 



E„ 



Since v and ir are both product-measures, 

2 



V 


2 


V 


2 


- - 1 






- 1 


7T 


7T 





=i 



n i+ 



Vi - TTi 



L 2 {-Ki) 



t) , 



i=l 
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vr|| T v < -\^exp(Ttt) - 1 . 



It is now clear that |^ — 7r|| TV — > as 9Jtj — > 0, though a more useful relation 
between these quantities is the following. When Tit > 1 put || V ^|| TV 

< 1 

trivially by the definition of total- variation distance, while elsewhere one has 
\\Je x — 1 < \fx for x € [0, 1]. Hence, 

||i>-7t||tv < V^- (3-1) 



Next, we assume that (1.2) holds towards deriving the full form of (1.3). 
Let U\ , . . . , U n denote independent random variables drawn from 7Ti , . . . , ir n 
respectively and define 



Y t = Yi(t) = Ui(Ui)/iri(Ui) 
Clearly by definition EK; = ^2 X V{ (x) = 1 whereas 

Ui(x) 



(3.2) 



Var(Y) = £ 



1 



TTiiX 



TTilX 



Wi - 7T, 



Moreover, the hypothesis (1.2) implies that \\Yi — l\\oo < e for all 1 < i < n. 
In particular, 



E|Y - 1| 3 < \\Y - l||ooVar(Yi) < eVar(Y) . 

Furthermore, noting that Yt > with probability 1 define Z% = logYi. By 
considering the Taylor series expansion of Z% we obtain that 

EZ t = E(Y - 1) - ±E(Yi - l) 2 + O (E\Yi - 1| 3 ) = _i±^M Var(Y) 
and similarly 

EZf = E(Y - l) 2 + O (E\Y - 1| 3 ) = (1 + 0(e)) Var(Y<) , 

where here and through the remainder of this proof the implicit constant 
in the 0(-)-notation is absolute. The random variables Z{ are independent 
0(e) by (1.2), hence the Berry-Esseen Theorem implies that 

0(e) 



and \\Zi 



sup 

X 



< x 



<f>(x] 



< 



Recalling definition (1.1) and that Var(Yj) < e due to (1.2), we get that 

n n 

^EZ, = -(I + 0(e))Wl t , Var(Zi) = (1 + 0(e))Wl t , 

i=l 8=1 

and altogether we find that for a sufficiently small e > the distribution of 
%i becomes arbitrarily close to that of a Gaussian Af(— gSPtj, SDTt) ■ 
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To relate this variable to the L 1 -distance of u from it observe that 
V-vr|| TV = ^2 \v{xi,...,x n )-ir(x 1 ,...,x n )\~ 



e n 



XI, 



\ Ki{Xi) 



1 



1=1 



E 



E 



exp 



1 

n 



i=i 



i 



(*+j^t) 2 

)e 2a5! * cte 



where |a|~ denotes max{— a, 0}. Using the above argument we can now ex- 
press the total- variation distance via a log-normal random variable, namely 
for every 5 > one can choose e > small enough such that 

| ||i/-7r|| TV -E|W- 1|" I < S, where log W ~ A^-^*, M t ) . 

The proof is concluded by noting that 

1 f 

E |W — II - = ; / (1 - e x )€ 

= P(JV(-^S« t ,9Jlt) < 0) -P(jV(i9nt,3Kt) < 0) 

= 2$( v / my2) - 1 , 

as required. ■ 
Using a standard Riesz-Thorin interpolation bound for the L°°-distance 
from equilibrium of a reversible Markov chain at time t > via the analo- 
gous L 2 -distance at time t/2 we can now write down the following corollary 
specializing the above proposition to the case of a product of i.i.d. chains. 

(n) 

Corollary 3.1. Let X t = denote a sequence of product chains with 

stationary measures n = n^ n \ where the n-th chain in the sequence is made 

(n) 

ofn i.i.d. copies of some finite ergodic Markov chain Yt = Y t . Let ip be the 
stationary measure ofY t , put <£> mm = min x ip(x) and let a s and gap denote 
the log-Sobolev constant and spectral gap of'Y t , respectively. If 

o rmin — 

then Xt has cutoff at jj^logn with window 0(ga.p^ 1 +a~ 1 \loglog(p^ n \ + ). 
Proof. Set t* = 2^ log n and define the following time interval for 7 > 0: 
t~ =t"( 7 ) =^-7 gap" 1 , 

t+ = i+( 7 ) = f* + 7 gap" 1 + (40s)- 1 1 log log ^| + , 

while noting that for any fixed 7 > we have t~ = (1 — o(l))t* and in 
addition t + = (1 + o(l))i* thanks to the hypothesis log (p m ] n = n o{ - a ^^\ 
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Following the notation of Proposition [?[ each X\ is an independent copy 
of If, and we start by verifying the condition ( |1.2[ ). This will follow from the 
well-known fact (see, e.g., |39| Eq. (2.4.7)]) that if Yj is a reversible Markov 
chain with stationary distribution <p then for any t > one has 

\\¥(Y t G •) - <p\\ Loa{v) < ||P(F t/2 G-)- p\\h(<p) ■ 



Combining the hypothesis logy? m [ n = n°( Qs /s a P) with Theorem 
follows that for any t x gap -1 logn we have 

max||P(Y* G •) " <p\\m<p) < exp (l - (1 - o(l))tgap) 

yo y ' 

We conclude from the last two inequalities that for any t > t~ 

max \\F(Y t G •) - (p\\ L «> {ip) < n^ 1 ^ 1 ) = o(l) , 
yo y ' 



2.1 



it now 



thus satisfying the prerequisite (1.2) for any e > 0. Consequently, by (1.3) 



\F(X t G •) - vr| TV = 2$( v / S^/2) - 1 + o(l) 



and it is left to evaluate 9Jti = nmax So ||P yo (Yi G •) — ^p\\ L 2^ at t~ and t + . 

For t~, the standard lower bound on L 2 -distance via the spectral gap 
(cf., e.g., [127]) implies that max^ \\F(Yt S •) — pWl 2 ^) — ex P( — *g a P)) thus 
£flt- > 6xp(7), and so |P(X t - G •) — 7r| TV — > 1 as 7 — > 00. 



For t + , another application of the log-Sobolev inequality in Theorem 2.1 
implies that 9Jt t + < exp(2 — 27), thus |P(X t + G •) — 7r| TV — )• as 7 — )• 00. 
Altogether, cutoff occurs at t* with window 0(t + —t~), as required. ■ 



Remark. Corollary 3.1 gives insight to the behavior of the Ising model on 
the torus (Z/nZ) a! . Consider the (much simplified) model where the torus is 
partitioned into boxes of side-length, say, m = logn, and each box evolves 
independently with its own periodic boundary conditions (all long range 
interactions are completely absent from this model). Observe the following: 

• The dynamics is a product of N = (n/m 

\d = n d-o(i) u 

.d. copies of 

Glauber dynamics for the Ising model on (Z/mZ) d , denoted by (It). 

• Let (p denote the stationary measure of (Yt) and note that its state 
space contains 2 md configurations and so log 99"^ = 0(m d ) = n ^. 

• The log-Sobolev constant of (Yt) is known to be uniformly bounded 
away from 0, hence in particular ^ogf~^ = A r °( cts/ ' sap ). 



Appealing to Corollary 3.1 we conclude that there is cutoff at (2A m ) _1 logn 
with window O(loglogn), where A m is the gap of the dynamics on (Z/mZ) rf . 
Given that X m converges to the spectral-gap of the infinite- volume dynamics 
as m — > 00 (as was indeed shown in [30|), this picture of cutoff coincides 
with the actual behavior of the Ising model on the torus proved in [30] . 
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Note that, as the Ising model does of course feature long range interactions 
(albeit weak ones at high temperatures), a significant part of the proof will 
entail controlling these interactions to enable an application of Proposition [7j 

3.2. Supports of random maps. In what follows we place Lemma 3.8 



of 30 in a general context. Recall that if if is a transition kernel of a 
finite Markov chain then a random mapping representation for K is a pair 
(g, W) where g is a deterministic map and W is a random variable such 
that F(g(x, W) = y) = K(x, y) for all x, y in the state space of K. It is 
well-known (and easy to see) that such a representation always exists. 

Definition 3.2 {Support of a random mapping representation) . Let K be & 
Markov chain on a state space T, v for some finite sets £ and V. Let (g, W) 
be a random mapping representation for K. The support corresponding to 
g for a given value of W is the minimum subset Aw C V such that g(-, W) 
is determined by x(Aw) for any x, i.e., 

g(x, W) = f w (x(A w )) for some f w : S Aw -> S y and all x. 

That is, w E Aw if and only if there exist some x, x' E Tj V differing only at 
coordinate v such that g(x, W) ^ g(x' , W). 

Lemma 3.3. Let K be a finite Markov chain and let (g, W) be a random 
mapping representation for it. Denote by A\y the support ofW w.r.t. g as 
per Definition \3.S\ Then for any distributions (p,ip on the state space of K, 

\\<pK-^K\\ TV < j \\y\ hw -i>\ Aw \\ TV d¥(W). 

Proof. Let ft denote the state space of K. Following the definition of 
the support set A w , let fw denote the deterministic map which satisfies 
g(x,W) = fw{x{A w )) for all ie!l. 

By definition of total-variation distance, 

\\<pK - VK|| TV = max [(<pK)<T) - &K)(F)} . 

Let X, Y be random variables distributed according to (p, ip respectively. 
The above is then equal to 

max j [F(g(X,W) eT) - F(g(Y,W) £ V)] dF(W) 

< J max [F(fw(X(A w )) E V) -F(f w (Y(A w )) £ r)] dF(W) 

< J \\F(X(A W ) E -)-¥(Y(A w ) E -)|| Tv dF(W) 
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where the second inequality used the fact that when taking a projection of 
two measures their total-variation can only decrease. ■ 

4. Cutoff for on arbitrary graphs via log-Sobolev inequalities 

4.1. Sparse supports. The main step in proving Theorem [l] would be to 
break up the dependencies between the spins and express the total-variation 
distance of the dynamics from equilibrium in terms of its projection on a col- 
lection of well-separated small clusters of spins, which we refer to as a sparse 
set of spins. Thereafter the proof would proceed by showing that these clus- 
ters are essentially independent, justifying an application of Proposition [7] 
to reduce Z^-mixing to L 2 -mixing for this projection. We first define the 
notion of a sparse set. Throughout the proof, let V denote the vertex set of 
G and let E denote its set of edges. 

Definition 4.1 [Sparse set). We say the set A C V is sparse if it can be 
partitioned into (not necessarily connected) components {Ai} so that 

1. Each Ai contains at most ^logn vertices. 

2. The diameter of every Ai in G is at most | log 2 n. 

3. The distance in G between any distinct Ai,Aj is at least 25AS~ 1 logn. 

Let S = S(G) = {A C V : A is sparse}. 

We can now state the main result of this subsection, which provides an 
upper bound on the L 1 -distance of the dynamics from equilibrium in terms 
of the corresponding quantity for its projection onto a sparse set of sites. 

Theorem 4.2. Let (Xt) be the Glauber dynamics on G and let fx be its 
stationary measure. Let 7Aa~ 2 logp < s < 2a s r 1 logn and t > 0. Then 
there exists some distribution v on S such that for large enough n 

\\Pa (X t+s G •) - /x|| TV < J HP^X^A) 6 •) - MklU <M A ) + 3n ~ 10 » 

and moreover : v G A}) < p~ 10 for every vertex v G V. 

The remainder of this subsection will be devoted to the proof of the above 
theorem, the first ingredient of which would be to consider a close variant 
of the dynamics which effectively turns distant sites into independent: We 
refer to this as the Barrier dynamics, as it is achieved roughly by fixing 
the boundary on the perimeter of a ball centered at each vertex (forming a 
barrier on the propagation of information). 

Recall that B v {i) = {w G V : dist(u,w) < £} denotes the ball of radius 
r about the vertex u, and similarly let dB v {i) denote the boundary of this 
ball, i.e. the subset {w G V : dist(v,w) = £}. For a subset A C V define 
B A (£) and dB A (£) analogously, e.g. B A {£) = U veA B v (£). 
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Throughout this section let 

r = [lOAS^lognJ , (4.1) 

where S s was defined in Theorem [I] to be the minimum of the log-Sobolev 
constants of the Glauber dynamics over all induced subgraphs of G with 
diameter at most log 2 n. Note that by definition p = max„ g y l^( r )l- 

Definition 4.3 [Barrier- dynamics). Let (Xt) be the Glauber dynamics for 
the Ising model on G. Define the corresponding barrier- dynamics as the 
following coupled Markov chain: 

(1) For each vertex u, let 0, u = {±l} Br ( u ). The state set of the barrier 
dynamics is the cartesian product of {f2 M : u 6 V} and its marginal 
on £l u is the Ising model on B u (r) with free boundary (to be thought 
of as if a barrier disconnected B u (r) from the remainder of G). 

(2) The coupling of the dynamics with the standard Glauber dynamics 
(Xt) is defined as follows: The initial configuration of each £l u is 
obtained by projection the initial configuration of X onto B u (r) 
for each u. Whenever the Glauber dynamics updates a site v via a 
variable I ~ U[0, 1] in the standard dynamics (occurring according to 
an independent unit-rate Poisson clock), each Q u such that v S B u (r) 
updates its copy of v via the same update variable I. 

For any s > 0, the barrier dynamics gives rise to a randomized operator 
Q s on {±1}^ obtained by projecting its configuration onto the centers of the 
balls B u (r). That is, in order to compute Q s (a), run the barrier dynamics for 
time s starting from the initial configuration formed from a in the obvious 
manner and denote its final configuration by {a u £ Q u : u £ V} . The output 
of Q s (o~) assigns each u G V its value in a u . 

Lemma 4.4. Let to = 2 a" 1 logra. For any sufficiently large n, the barrier- 
dynamics and original Glauber dynamics are coupled up to time to except 
with probability n~ 10 . That is, except with probability n~ 10 , for any Xq we 
have X s = Q s (Xq) simultaneously for all s < to- 

Proof. Let (Xt) denote the Glauber dynamics on G and let (Xt) denote the 
barrier-dynamics coupled to (Xt) as defined above. Clearly, for any I < r 
and v £ V we have that X s = X s on B v (l) as long as none of the sites 
comprising dB v (l) were updated by time s. Therefore, if we assume that 
Xt(v) Xt(v) for some v S V and s < to then there necessarily must exist 
a path of adjacent sites ui,...,U£ connecting v to dB v (r) and a sequence 
of times t\ < . . . < tg < s such that site iij was updated at time U (and as 
such I > r). Summing over all A £ possible paths originating from v, and 
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accounting for the probability that the t corresponding unit-rate Poisson 
clocks fire sequentially before time s < to it then follows that 

p( U {Xt(v) ?X t (v)}) <n^A e F(Po(t )>£) < 2ne" tf '^^ 

^ueV ' Or 



r\ 



where the last inequality followed from the fact that r > 2Ato- Moreover, 
since r = (5 + o(l))Ai we further have (At ) r /V! = (e/5)( 1+0 W) r . Using the 
facts e/5 < e -3 / 5 and r > (20 + o(l)) logn (recall that A > 2 and S s < 1) 
we can now infer the required result (with room to spare) for large n. ■ 

We next define the notion of an update support, specializing the notion 
of the support of the random mapping representation from Definition 3.2 to 
the barrier-dynamics map. 

Definition 4.5 (Update support). Let W s be an update sequence for the 
barrier-dynamics between times (0, s). The support of W s is the minimum 
subset A\v a C V such that Q s (x) is a function of x(Aw a ) for any x, i.e., 

9w s (x) = f Ws {x{A Wa )) for some f Ws : {±1} A ^ {±1}^ and all x. 

For any given W s its support Aw a is uniquely defined. 

The following lemma demonstrates the merit of considering the update 
support. 

Lemma 4.6. Let (Xt) be the Glauber dynamics on G, set to = 2 a" 1 logn 
and let W s be the random update sequence for the barrier- dynamics along 
an interval (0, s) for some s < to- For large enough n and any o"o and t > 0, 



■„ (X t+a E •) - /i|| TV < J \\¥ ao {X t {A Ws ) E •) - mUvk s || tv dP(W s ) + 2n- 



10 



Proof. Let (Xj) be Glauber dynamics at time t started from Xq = ctq and 
let Y E ^ be distributed according to By considering the random map 
Q s as a discrete Markov chain on f2, an application of Lemma 3.3 gives that 

wnosixt) g o-m^) g oii T v < Jwhm^w.) g •) -mu^ju^w,). 

Since s < to, two applications of Lemma 4.4 will now couple Xt+ S with 
Gs(Xt) and similarly £7s(^) with Glauber dynamics run from Y for time s 
(having the stationary distribution fj,) except with probability n -10 : 



\\F(X t+a e-)--p(g 8 (X t )€-)\\ TV <n 



-10 



||P(^(r)G-)-^ll TV <n- 10 . 
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Combining these estimates, it follows that 

\\F(x t+s e •) - mLv < \\nOs(x t ) e •) - F(g s (Y) g .)IU + 2n ~ w 



< 



J \\F(X t (A Ws ) G •) - m|a Ws || tv dP(W s ) + 2n- w , 



as required. 



Lemma 4.7. Lei <7 S 6e i/ie barrier- dynamics operator, let W s be its update 
sequence up to time s for some s > 7Aa~ 2 log p, and let S be the collection 
of sparse sets of G. For sufficiently large n we have P(Aw s G 5) > 1 — n -10 
and furthermore P(v G Aw s ) < p 10 for every v £ V. 

Proof. Let U v for v G V denote the event that the outputs of the barrier- 
dynamics operator Q s computed on the all-plus and all-minus inputs disagree 
on the value of v. Observe that if Q s does agree on v under the two extreme 
inputs of all-plus and all-minus then by monotonicity the initial configura- 
tion of B v (r) has no influence over this spin, and in particular, 

A Ws C |J {B v (r) : v G V such that U v holds} = A Ws . (4.2) 

Since the family of sparse sets monotone-decreasing, it clearly suffices to 
establish both statements of the lemma for the super-set A^ s governed by 
the indicators {!{/„ : v G V}. 

To estimate the probability of the event U v let (X^~) and (Xf) be two 
instances of the barrier-dynamics projected to B v (r) and starting from the 
all-plus and all-minus states respectively, coupled to each another via the 
monotone coupling. These projections are simply the standard Glauber 
dynamics on B v (r), hence letting it = pr B , we get 

F(U V ) = F(X+(v) + X~(v)) < ||P(X+ G •) - vrllxv + ||P(^ G ■) - vr|| TV 
< |||P(X+ G •) " Av*{*) + U F (Xs G •) - • 



Due to Theorem |2.1[ if the all-plus state 1 has stationary measure at most 
e _1 (clearly the case for large n since |S„(r)| > r — > oo and the interactions 
are finite) then for any s > 

p(1 * + 6 ■> - HL, 5 ex » (' - gap ( s - £ loglog ^y) ) • (43) 

where A and a s are the spectral gap and log-Sobolev constant resp. of the 
Glauber dynamics on B v (r). Recalling that trivially vr(l_) > 2~^ Bv ^, by 
definition of p we have loglog(l/7r(l)) < logp, and since gap > a s > a s the 
assumption on s easily gives 

s - log log — ^ > (7AS- 1 - I) gap" 1 log p , 
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hence for large n we can absorb the pre- factor of e in the r.h.s. of (4.3) and 
obtain (with room to spare) that 

||p(1+g-)-^IIl 2w <^ 6A/Ss . 

By the exact same argument we also have ||P(X~ G •) — ^\\l 2 (tt) — p~ 6A / Qs 
and it now follows that 

V(U V ) < /T 6A / Ss . (4.4) 



Going back to the definition of Aw s in (4.2 ) we can infer that for any v € V, 

where the last inequality (which used the fact that A > 2) implies the sought 
upper bound on F(v G A-w s ). 

We now wish to show that P(Ajy s £ S) > 1 — n~ 10 . To see this, denote 
by 5 the event that there exists some sequence S of £ = [logn] points, 
S = (vx,v<2, . . . ,vg), satisfying the following: 

1. For every i we have 2r < distc ({^i, • • • , Vi}, Wj+i) < 5r. 

2. For every i the event U v . holds. 

Notice that for any sequence S satisfying Item [TJ given v\, . . . ,Vj there are 
at most J2i<j l-^i(5 r )l < jp 5 choices for the vertex Vj + ±. Furthermore, the 
balls {B Vi (r)} are pairwise disjoint and as a consequence the events {U Vi } 
are mutually independent. As ¥(U Vi ) < p~ 6A / Qs < p~ 12 for all i we now get 

P(S) < n{£ - ljjp 8 ^- 1 )- 1 " < n (ep- 7 ) 1 < np~ U < n 1 " 610 ^ < n - loglogn 

for large enough n, as in that case one has p>r> [log n] . 

Next, let S' denote the event that there exists some sequence S' of 

a s logn 
A log p 

points, S' = V2, ■ ■ ■ , Vg), satisfying the following: 
1'. For every i we have 2r < distg (vi,vt+i) < 5r. 
2'. For every i the event U Vi holds. 

We now argue as above: For any sequence S satisfying Item[TJ, given Vj there 
are at most |£?„,.(5r)| < p 5 choices for the vertex Vj+i, and as before the 
balls {B Vi (r)} are pairwise disjoint and so the events {U Vi } are independent. 
Plugging in the fact that ¥(U V .) < p~ e ^/ as for all i now yields 

P(E') < npW-Qp- 6 ^ 1 ' < np~ 3AS s le = np-^ o{1)) ^ = n' 11 ^ . 

Altogether, for sufficiently large n we have P(HUH') < n -10 . Conditioned 
on the fact that neither H nor E' occurred, and consider the partition of Ayy a 
to components {A{\ obtained by repeatedly applying the rule whereby one 
identifies the components of B u (r) and B v (r) if dist^ii, u) < 5r. Clearly, 
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if R = {v £ V : U v holds} then each Ai can contain at most p 2 £ vertices 
of R, otherwise one could greedily find in Ai a sequence S satisfying the 
conditions specified in the event S. Since A\y 3 = \J v&R B v (r) we get that 

\A\ < P 3 t' < P 3 logn for all i . 

Similarly, for every u,v £ R that belong to the same Ai we must have 
dist<3(«, v) < 2r£' or else one could find in A4 a sequence S' satisfying the 
conditions specified in the event S'. Thus, taking into account the inclusion 
of an extra B v {r) for all v £ R when forming Aw s we infer that for all i 

diam G (^i) < Irt' + 2r < 80 + i og 2 n < I \ og 2 n 

logp 2 

where the last inequality holds for large enough n as p diverges with n. 
Finally, if u, v E R belong to distinct components Ai,Aj then by definition 
dist(5(w, v) > 5r. Thus, accounting for the addition of {B u {r) : u G R} when 
forming A^ s gives that for all i 7^ j and sufficiently large n, 

dist G {Ai,Aj) > 3r > 25AS E T 1 logn. 

We conclude that F(A^y s S S) > 1 — n -10 , as required. ■ 



The statement of Theorem |4.2| is an immediate corollary of Lemmas 4.6 
and[4/FJ obtained by setting u(k) = F(W S : A Wa = A) for A € S. 

4.2. Convergence in L 1 of the dynamics projected on sparse sets. 

So far we showed an upper bound on the ^-distance of the dynamics from 
equilibrium in terms of its projection onto a sparse set of sites. The goal of 
this subsection is to provide the following estimate for the latter quantity: 

Theorem 4.8. Let A £ S and let A±, . . . , Al be its partition to components 



as per Definition 1^.1 , Let (XI) be the Glauber dynamics on Aj~ = B^ir) 



with free boundary and the original interactions and external field, and let 

"ther define 

I II 2 



7Tj = fi® A+ denote its stationary distribution. Further define 
m t (Ai,ao) = \\F ao (Xi(Ai) e ■)-**' 



TXt(A,a ) = J2m t (Ai,a ) . 

i 

Then for any o"o and j logn < t < to, the dynamics (Xt) on G satisfies 

\K (X t (A) £ ■) — mUIU = 2<D( v / 9H i (A,a )/2) - 1 + o(l) , 
where the o(l)-term tends to as n — > 00. 

The following lemma will allow us to relate the projection of the dynam- 
ics onto a sparse set A £ S to the product-chain of the dynamics on the 
component partition corresponding to A. 
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Lemma 4.9. Let A G S and let At, . . . ,Al be its partition to components 



as per Definition J^.l, Set Af = Bj^ir), let (X%) be the product chain of 
Glauber dynamics on the graphs induced on Af, . . . , Af independently (with 
free boundary and the original interactions and external field), and denote 
its stationary distribution by ir = Y\i ■ Then for any o~q and t < to, 



F ao (X t (A) G -J-mIaIU " IK*PQ*(A) E •) - tt| a | 
where = ao(UiAf). 



TV 



< ri 



-2/3 



Proof of lemma. Let (Xt) be the original Glauber dynamics (Xt) for the 
graph G and consider its standard coupling to the dynamics (X*), that is, 
run the two chains via the same unit-variables and Poisson clocks for the 
site updates. Crucially, the pairwise distances between the components Ai 
are all strictly larger than 2r and so the sets Af are pairwise disjoint, thus 
this coupling indeed retains the independence between the coordinates of the 



chain X%. Moreover, the proof of Lemma 4.4 implies that the projections 
onto Ai, . . . , Al are maintained identical in this coupling until time to except 
with probability at most n~ 9 (the difference from the original estimate of 
n -io ^ n that lemma is due to a union bound over the L < n components). 
That is, with probability at least 1 — n -9 

X t (A) = X t *(A) for all t G [0,t ] , 

and in particular, letting Ug be the configuration derived from a® in the 
obvious manner we get that for any t < t$, 

\K (X t (A) G •) — Po-*(X t *(A) G .)IItv < n-\ (4.5) 

To relate /x|a to 7t|a we first argue that Xt is well- mixed under total- 
variation distance. Indeed, if (Xf~) and (Xf) are instances of the dynamics 
starting from the all-plus and all-minus states resp. then for any v G V, 



(X t » + X t -(v)) < F (x+(v) + X t -(v)) + n 



-10 

where (Xt) denotes the Glauber dynamics on the graph B v (r) with in- 
teractions and external field inherited from G, and the additive term of 
n~ 10 is justified by the same coupling argument given above. The event 
{X^(v) / X^(v)} was considered in Lemma 4.7 there denoted by U v , and 
exactly the calculation that led Eq. (4.4) valid for time s > 5AS~ 2 log/0 now 



gives that 

F (x+(v) ^ Xi o (v)) <n-^. 
Taking a union bound over the vertices it now follows that 
F(Xt^X^)<n-^ + n^ = 0{n-^) ) 
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and in particular 

max \\K (X t0 E •) - m||tv = 0(n" 3 / 4 ) . (4.6) 



Co 



Let Xq be a configuration on UiAf distributed according to it and extend it 
arbitrarily to a configuration Xq on G. As before, with probability at least 
1 — n~ 9 we can couple (X t ), (X%) so that they would agree on A = UjAj 
throughout the time interval [0,to]- Since X 4 * (A) ~ 7t|a and total- variation 
distance can only decrease on a marginal we can infer from (4.6) that 

IWa-mIa||tv= l|Px*(x*(A) e O-mUIItv 

< \\F Xo (X to (A) E •) - p\ A \\ TV + n~ 9 

< \\F Xo (X to E •) — HItv + n~ 9 = 0(n~ 3 / 4 ) . 



Combining this with (4.5) via the triangle inequality implies the statement 
of the lemma with the estimate <3(n -3 / 4 ) < n -2 / 3 for sufficiently large n. ■ 



In the course of the proof above we obtained in (4.6) that for any 5 > 0, 

tuix(S) <t = 2a~ 1 logn. (4.7) 

As a consequence of this, we claim that in our proof of Theorem [T] we may 
assume without loss of generality that for large enough n we have 

p<n 1/w , and A/a s <^logn. (4.8) 

Indeed, recall that our aim is to show that t m x(S)—t MSX (l—S) < 16AS~ 2 log p 
for any fixed 5 > 0, and observe that to this end we need only consider the 
case where t mx (S) > 16AS s " 2 logp for small enough 5 > as otherwise the 
sought statement trivially holds. Combining this with the aforementioned 
upper bound on t mx (S) gives 

AS^logp^ ilogn, (4.9) 

producing the above bounds on p (as A > 2) and A/S s (as p — > oo with n). 

Armed with these estimates as well as Lemma 4.9 we now turn to prove 
Theorem EDO 



Proof of Theorem 14.81 By Lemma 4.9 it suffices to show that the product 
chain (X*) with coordinates {X\) and stationary measure ir = Yl^i satisfies 



||P CT * (A7(A) E •) - 7t|a|| tv = 2*(yaK t (A J a*)/2) - 1 + o(l) (4.10) 

for any sparse set A, and initial configuration cjq and any | logn < t < to- 

Consider the chain (XI) for some 1 < i < L, run from an arbitrary initial 
configuration <tq till time 

t > sq = 4S" 1 logp . 
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It is easily seen that for any two measures p, ip on a state space V and a 
map / : V — > T' , the L°°(tt) distance of p from tp can only decrease when 
taking the marginal of the measures on T' since 



max 



i 



< max > — - , . , , = \\(p ■ 



Specialized to our setting we get that 

||P CT0 (^(A) e ■)-n\A i \\ L ^ i]Ai) < \\K (Xl e O-Till^, 



< 



( x l/2 £-)-m 



■ o"o \ yv t/2 



where the last transition incorporated a standard reduction from L°° to L* 



(see e.g. [39 1). Further recall that by Theorem 2.1 



< exp ( 1 — gap 



1 1 i , 1 

O ~ ~A lo g lo g— 7 \ 

2 4a s vrj(o-o) 



where A and a s are the spectral gap and log-Sobolev constant resp. of the 
chain (XI). We can control the diameter of Af as follows: 

diamc(^) < 2r + diamc(Aj) < 20^- logn + ^ log 2 n < log 2 n , 



where the last inequality was thanks to (4.8). As a consequence, we have 
a s > a s by definition of a s . Moreover, since 

\Ai\ < P 3 log n we have 



log log 



1 



= (l + (l))log|A+|<(l + o(l))log(p|^|) 
< (1 + o(l)) logGo 4 logn) < (5 + o(l))logp, 



(here we used the fact that p > log n) and the fact t > sq now implies that 



2 4a. 



log log 



7Ti(cr ) 



> 



o(l))gap 1 logp. 



hence for sufficiently large n we get 

||P CT0 (^(A) e •) - tt.U, || ioo(7riUj) < p^l 2+ °^ = o{l) • (4.11) 

By (4.9) we have 4a~ 1 logp < ^logn < \\ogn (the last inequality due 
to the fact that A > 2), and so in our setting indeed t > \\ogn > sq. 
We have thus established that every (Xj) fulfills the requirement (1.2) of 
Proposition [7] for arbitrarily small e > 0. The application of this proposition 
now establishes (4.10), completing the proof. ■ 
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4.3. Proof of Theorem [TJ We begin the proof by formulating an expres- 
sion for the cutoff location. Let SDTt be as in Theorem |4.8| and set 



t* = mf<t>0 : maxmax9JWA,o- ) < p > . (4.12) 
[ AeS CTO J 

Observe that 9Jtt(A, cxo) is continuous and monotone decreasing in t. A useful 
lower bound on these quantities is the following: 

maxmax9Jt t (A, a ) > np~ 3 e~ 2t for any t > 0. (4.13) 

AeS <t 

To see this, let A consist of an arbitrary set of np~ 3 singletons whose pair- 
wise distances all exceed 3r, the existence of which is guaranteed by the 
definition of p. It therefore suffices to show that for each v £ A we have 
Wl t ({v}, (To) > exp(— 2t) for an appropriately chosen Co G {±l} Bv ( r \ Write 
■0 = f/r g ( r \\{ v }j assume without loss of generality that 0(— 1) > | and let 
o"o be the all- plus starting configuration. Write ipt = F ao (X^(v) £ •) and 
note that the monotonicity of the Glauber dynamics implies that (ft stochas- 
tically dominates if) for any t > 0, and furthermore this holds even when 
conditioning on the sequence of updates sites up to time t (yet without re- 
vealing the unit- variables used for generating the new spins). As such, if 
t v = inf{t : XI updates the site v} then for any t > 

> P(r„ >t) + V(l)P(r„ <t) = 0(1) + e~V(-l) > 0(1) + \^r l , 

with the last inequality due to our assumption on Immediately it 

follows that 

\\<Pt - V'IIl 2 ^) >\\<Pt - ^h 1 ^) = 2 M 1 ) - 0(1)1 > e ~* , 
thus establishing (4.13). Since p < n 1 / 16 thanks to (4.9) we immediately 



observe that at t = we have maxA max^, 9Jti(A) > np 3 > p and hence it 
follows from continuity that 



max max Tit* (A, do) 
AeS (Jo 



Moreover, Eq. (4.13) implies that exp (-2t*) < p 4 /n < n" 3 / 4 (where we 



again used the fact that p < n 1 / 16 ) and after rearranging this yields 

3 1 

t* > -logn > - logn. (4-14) 
8 4 

Conversely, using the log-Sobolev argument as in the proof of Theorem |4.8| 
one obtains that if A is a sparse set and A\, . . . , Al are the components 
comprising its partition then, using the same notation as Theorem 4.8 we 
have 

m&xmt (Ai,a ) < exp [-2 (t + (| + o(l)) logp)] < n" 3 / 2 . 
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As L < n it follows that for t > to one has maxAmaxCT,-, 9Jtt(A, <ro) < n -1 / 2 
whereas at t = t* this quantity should equal p > logn. Altogether we deduce 
that for large enough n 

t* < t = 2S E T 1 logn. (4.15) 

At this point it we can establish that t mx (l — e) > t* for any fixed e > 
and large enough n. Indeed, the above given bounds on t* place it in the 



admissible range for an application of Theorem 4.8, and taking A and ctq to 



be those achieving the maximum in the definition (4.12) we get that 



\\F ao (X t * G •) - HI TV > Fv (Xt*(A) e •) - HaIItv 

= 2$(^UT t (A,a )/2) - 1 - o(l) . 

Since !DV(A, o"o) = p — )• oo we infer that ||P<j (Xi* G •) — p|| TV = 1 — o(l), 
as required. 

It remains to show that t mx (£) < t* + 16A2~ 2 log / o for any fixed e > 
and large enough n. Thanks to the reduction provided by Theorem 4.2, this 
would follow from showing that 

max / ||P CT0 (AV(A) G •) — m|a|| tv dv(A) = (!) ( 4 - 16 ) 
a ° Js 



where v is the distribution on S specified in that theorem. By Theorem 4.8 



and the bounds (4. 14), (4. 15) on t* we can replace the integrand in the above 



equation by 2$ (-^/Wlt* (A, (tq)/2) — 1 — o(l) and hence reduce the task of 



establishing (4.16) to showing that 

max / $(V9MA,<7 )/2) du(A) = J + o(l) . (4.17) 

To this end, let <ro be arbitrary and let Ao be the sparse set that maximizes 
VJlt* (Ao, 0"o). To simplify the exposition, in what follows we will omit the 
reference to ao as an argument of 9Jtj* when there is no danger of confusion. 

It will be useful to define another sparse set A as follows. Begin by setting 
A = 0, then repeatedly add components {Ai} to A via the following rule: 

• Let A±, . . . , Ak-i be the components already collected. 

• Let Ak C V be the subset maximizing TX^iA^) subject to the con- 
straint distc(Aj, Ak) > 3r for alH < k as well as \Ak\ < p 3 logn and 
diam G (A fc ) < \ log 2 n. 

• The process terminates once no such set Ak exists. 

Note that clearly 

9MA)<9MAo)<p ( 4 - 18 ) 
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(the last inequality may be strict since ao was chosen arbitrarily as opposed 
to selecting the configuration for which Tit* could obtain its global opti- 
mum). For each v £ V we associate the following quantity measuring the 
weight of the closest Ai to it: 

6 V = max|mt*( J 4j) : distc(t>, Ai) < 3r j . 

Motivating this definition is the following simple fact: We argue that if A\ 
is a component of some sparse set A S S then 

Wt*(Ai) < max#„ . 

To verify this fact, let v be the vertex with the maximal 8 V among all vertices 
of Ai and let k > 1 be the smallest index such that dist^^, Afc) < 3r. 
(Note that k is well-defined since if no such A^ existed then v itself would 
be admissible as an additional singleton cluster in A, contradiction.) By 
the maximality of v that is to say that k is the smallest index such that 
dist(j4j, Ak) < 3r. Since distc(Aj, Aj) > 3r for all j < k it follows that Ai 
satisfies all the conditions required from Ak+i and thus by definition of A 

m t *(Ai) < mt*(A k+1 ) < m t *(A k ) = 6 V , 

as claimed. Since the A^s are disjoint, this fact implies in particular that 

l{ueA} f° r an Y A € 5 . 

Recall now that the measure v over the collection of sparse sets S satisfies 
z^(A : v £ A) < p~ 10 for any v E V. Plugging this in the above inequality 
we obtain that 

f Tl t *(K)dv{K) < p- 10 ^2e v . (4.19) 

To estimate Y2 V @v recall that each 6 V amounts to Tit* (Ai) for some Ai whose 
distance from v is at most 3r. Thus, each Tlt*(Ai) appears at most |^i|/5 3 
times in this sum, and since by construction \Ai\ < /j 3 logn we obtain that 

°v ^ Yl mt * lo S n = m ** fi)p 7 lo § n ^ m t* ( A o)/° 7 lo § « < p 8 log n , 



V 



where the last two inequalities used (4.18). By combining this with (4.19) 



(and recalling that p > logn) we now infer that 

J Tl t * (A) dv(A) < p~ x = o(l) . 

Markov's inequality now implies that ^({A : Tit* (A) > e}) = o(l) for any 
fixed e > 0. This readily implies Eq. ( 4.17| ) by the continuity of the integrand 



in that equation as a function of Tit* and thus completes the proof. 
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5. General spin system models 
In this section we extend the cutoff criterion, established in the previous 



sections for the Ising model, to general spin systems as defined in { 2.3 
including for instance the Potts, proper coloring and hard-core models. 

While to a large extent the proofs presented thus far were not model 
specific and mainly used bounds on the log-Sobolev constant, we did use 
the monotonicity of the Ising model in an essential way in the context of 
the barrier dynamics operator Q s . Indeed, as defined Q s is a grand coupling, 
a coupling of all initial configurations simultaneously. The monotonicity of 
the Ising model implies that if the chains starting from all-plus and all-minus 
agree at some vertex v then so do those started at any other configuration. 
For general spin-systems which do not have monotonicity we will construct a 
grand coupling and analyze it using ideas similar to those in the construction 
of perfect simulation algorithms (see, e.g., [25)). 

We will consider Markovian grand couplings where the Poisson clocks on 
the vertices are identical for all chains and the updates depend only on the 
current states of each chain and new independent randomness. 

For such a grand coupling, let Xf° (v) denote the copy of the chain started 
from Xq = xq. Further define the disagreement process S£ : M+ x A — > {0, 1} 
with %t(u) denoting the indicator that there exist two initial configurations 
xq,x' G y A such that at time t the corresponding chains disagree at site 
u, that is, X t °{u) ^ X t °(u). By this definition clearly = 1. If for some 
C, c > the coupling satisfies the following exponential coupling condition 

maxP(JT t (u) = 1) < Ce~ ct , (5.1) 

then we attain the following generalization of our framework for the Ising 
model to the case of a general spin-system. 

Theorem 5.1. Fix d > 1 and suppose that there exists a Markovian grand 
coupling for Glauber dynamics for a general spin-system on a box A C Z d of 



side-length n which satisfies (5.1). Then the dynamics exhibits cutoff with a 
window of O (log log n). 

Proof. We describe the modifications required in the proof of Theorem [l] 



to complete the result. First note that the bounds of Lemma 5.2 imply 



order logn mixing under any boundary conditions and so combined with 



Theorem 2.3 of 18 we have strong spatial mixing and hence a uniformly 
bounded log-Sobolev constant [37] . Most of the proof and the preceding 
lemmas can then be taken as unchanged yet in a few places monotonicity 
is exploited. We define the barrier-dynamics in the same way, taking the 
single spin updates according to the grand coupling described above. 
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In Lemma 4.7 monotonicity is used to give an upper bound on the prob- 



ability of all chains agreeing at a vertex v via the probability of the all plus 



and minus agreeing at v. The exponential coupling bound (5.1) gives the 



required bounds in the general setup. Similarly in Lemma 4.8 a point-wise 
monotone coupling bound is used to bound the total variation between two 
measures which can be replaced by the exponential grand coupling bound. 

The proof of Theorem [l] begins with an estimate showing that t* > | log n 
which ensures that the dynamics on the components Ai are well mixed in the 
L°° distance. On the lattice Z d all such components are poly- logarithmic in 
size and so mix within time O(loglogn). We may instead take 



t* = inf < t > (log log n) : maxmax9Jtt(A, <jq) < log log n 
[ Ae-S <jo 

which is sufficiently large to apply the argument in the proof of Theorem [Tj 
and so implies an upper bound on the mixing time of 

tuix(e) < t* + Clog log n, (5.2) 

for large enough C > 0. Results of |20| imply that t Mlx (e) > clogn for 
some c > and so t* > clogn — C log log n. Since we have excluded the 
possibility that t M ix(l ~ e ) < (log log n) 2 for large n it follows from the proof 
of Theorem [I] that t MIX (l — £) > t* for large enough n. Combined with (5.2) 
this establishes cutoff with a window of order log log n. ■ 



5.1. Soft interactions. We now construct grand couplings satisfying (5.1) 
when the temperature is sufficiently high. We will focus on the heat-bath 
dynamics but note that analogous results apply for the Metropolis-Hastings 
chain (as detailed in the remark at the end of this section). 



For a general spin system on a box A C Z J with a state space 5^^ for 



some finite set 5? (see ^2.3 for definitions) define the following measure of 



"temperature" for the system: 



min \x\a{x) = s I <t(A \ {x}) = r](A \ {x}) 



It is easy to see that < < 1. Roughly stated, values of close to 1 
correspond to systems with weak interactions and high temperatures. 

Given £ M one can define a grand coupling for heat-bath Glauber dynamics 
as follows. Recall that a vertex x is chosen to be updated by a rate-one 
Poisson process and with the new state being given as some function of 
<r(A\ {x}) and a random variable U uniform on [0, 1]. When U < ^ we can 
set a(x) independently of the current configuration, selecting state s with 
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probability 

min u[o~(x) = s I a(V \ {x}) = i](A \ {x}) ) . 

For U > Cfi the transition probabilities can be selected arbitrarily so long as 
the chain has the correct marginals. 

The contact process is a well known stochastic process which models the 
spread of infections (see e.g. |28|). Denoted here by <3/ : IR+ x A — > {0, 1}, 
a vertex in state 1 corresponds to an infection which heals over time but 
may also spread to neighboring vertices. More formally, vertices are set to 
(healed) according to i.i.d. rate h Poisson clocks. Infections spread along 
edges at according to rate A Poisson clocks so that if one end is 1 when 
the clock rings, the other end is set to 1. The following lemma bounds 
the disagreement process by a contact process and shows that under the 
following condition 

lim inf C M > ^ , (5.3) 

n->oo 2d + 1 

the probability of a disagreement converges to exponentially fast. 

Lemma 5.2. The disagreement process is stochastically dominated by a 
contact process W with healing rate and infection rate 1 — Then 

maxP(^(u) = 1) < exp (- (2d + 1) - ^—^j tj . (5.4) 

Proof. If a vertex u with 3£t(u) = 1 is updated then by construction with 
probability at least ^ the grand coupling selects the same spin in each the 
chains. That is, we set 3£t(u) = with probability exceeding the healing rate 
of the contact process. At the same time, a new disagreement can be created 
only when updating a vertex adjacent to a vertex with a disagreement. In 
this case the probability of a new disagreement is at most 1 — Cm an d hence 
this bounds the rate of the spread of infections. Taken together this implies 
that stochastically dominates %%. 

Define the number of infected sites as Wt = Z~2ueA^t( u )- Vertices are 
healed at a rate of WtCu while new infections occur at a rate of at most 
2d(l — (/j,)Wt which may be less when infected vertices are adjacent. Hence 

MWt < EW t (-C M + 2d(l - (,)) = -(2d + 1) U„ - ^j^) EW t , 



dt 
and so 



Eir,< |A|<vxp(-(2J+L)(C M -2^)/) ■ <^-» 
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which is only meaningful when equation (5.3) holds. If A is transitive, for 



example a torus, then it would immediately follow that by symmetry that 
nm(u) = 1) < exp (-(2d + 1) (Cu - t] . (5.6) 



2d + l. 

For general A C Z rf we can embed A into a large torus and note that the 



contact process is monotone in the edge set of the graph implying (5.6) 
for general graphs. The stochastic domination then implies equation (5.4), 
completing the lemma. ■ 

With this construction we are now able to complete the generalization to 
high temperature systems with soft interactions. It is easy to verify that on 
any graph with maximal degree A satisfies C, > A/(A + 1) for 

• ferromagnetic Potts model with < /3 < log(l + 40- 

• anti- ferromagnetic Potts model with > /3 > — j% log(l + A(g-i) )• 

• gas hard-core model with A < 1/A. 



A more direct analysis as in |24l Section 5.2] , [25| Theorem 3] and 33 44 



(noting that in the last case the coupling can be extended to a suitable grand 



coupling as required) establishes the exponential coupling condition (5.1) in 
a somewhat broader regime for the ferromagnetic Potts, anti-ferromagnetic 
Potts and hard-core models respectively. For instance, the requirement on 
the fugacity in the hard-core model becomes A < 2/ (A — 2). Combining 
these results with Theorem 5.1 establishes Theorem |3j Specifically for the 



square lattice Z d we can refine the requirement A < l/(d — l)by exploiting 

the bipartite nature of the geometry. Indeed, the standard reordering of the 

partial order on the lattice turns the system into monotone, at which point 

the analysis of §|4] becomes valid. Theorem |4] then follows from the work of 

I i (A— 1 

Weitz 451 which established strong spatial mixing whenever A < (a-2) a • 

Remark. The above analysis for the heat-bath dynamics can be easily ex- 
tended to the Metropolis-Hastings chain. The main alteration is to modify 
the definition of and set 

Ja(x) = s\a(A\{x}) = ri(A\{x}) 



mm 

xeA 

say 



?f£ M (a(x) = s> | a(A \ {x}) = rj(A \ {x})) 



Similarly to the heat-bath analysis, it is easy to verify that < < 1 and 
that one can construct a Markovian grand coupling for Metropolis where 
each update is independent of the present state with probability With 
this definition we therefore again obtain that inf n £' > 2d/(2d + 1) is a 
sufficient condition for cutoff on Z d . 
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5.2. Proper colorings. In the case of proper (/-colorings condition (5.3) 
never holds as we always have that = since having a neighbor with 
color s precludes a vertex from having color s. We instead use a modified 
grand coupling as defined in |25[ Section 5]. When a vertex u is selected 
for updating we generate a random permutation of the q colors and choose 
the first color which does not appear amongst its neighbors. This way, in 
the setting of the lattice Z d one of the first 2d + 1 colors must be chosen 



regardless of the local neighborhood. Now 25, Theorem 2] and its proof 



imply that with q > Ad(d+1) colors the probability of a disagreement in the 



grand coupling decays exponentially fast, thus establishing equation (5.1). 
The proof of Theorem [5] then follows from Theorem 5.1 



6. Cutoff for lattices with boundary conditions 

In this section we analyze the important case of the Glauber dynamics 
(Xt) on the cube Z^ with plus boundary conditions with a proof that natu- 
rally extends to the setting of free boundary conditions as well. Whilst boxes 
with periodic boundary conditions are transitive, imposing plus boundary 
conditions means that vertices in different parts of the graph must be treated 
differently. We capture this by characterizing vertices in terms of how many 
faces of the cube they are close to. 

As well as showing cutoff we will determine the cutoff location in terms 
of spectral gaps for infinite volume dynamics. Define Mj = Mjd to be the 
intersections of half-planes in Z d given by Z^_ x Z^ - -? where Z + = {x 6 
Z : x > 0}. We let X^ = Xj^' denote the spectral gap of the Glauber 
dynamics on Mj with all-plus boundary conditions and let fj,j denote its 
stationary distribution. With this definition we prove the following theorem 
to which Theorem [6] is a special case. 

Theorem 6.1. The Glauber dynamics (Xt) for the Ising model on Z^, the 
cube of side-length n with all-plus boundary conditions exhibits cutoff at 

- max (d- j)A^logn 

A 0<j<d ■" 

with a cutoff window of width O(loglogn). 

First we begin by defining blocks with a mixture of plus and periodic 
boundary conditions. Fix m = [log 3 n\ and for each < j < d let 
be the graph Z^ where we impose plus boundary conditions on the first 
j coordinates and periodic boundary conditions on the remaining d — j 
coordinates. In the case of d = 2 then T (0) is the torus (Z/mZ) 2 , T (1) 
corresponds to a cylinder with plus boundaries at then ends while 
corresponds to a box with all plus boundary conditions. 
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Let B = Bj denote the subset of T^> given by 



B, 



2m 
~3~ 



m 
IF' 



5m 



d-j 



'~6~J 



which we will refer to as the inner block. Note that the inner blocks Bj 
touch faces of the coordinates where there is a plus boundary condition. We 
define 

2 



m 



U) 



m 



(?) 



(m) : = 



max 

XO 



x 



[X^(B)e-)-^\ B 



5.1 



where denotes the Glauber dynamics on and its stationary 
distribution. 

For i G Z n we define the following intervals: 



J; 



and 





. . ,m} 






1 <i < f , 




< {f- 


— i + 


m 
2 


1} 


f + 1 < i < n- 


rn 
2 


{?! — 


m + 1, . . . 


,n} 






n, 


'a- 


2m\ 
••53/ 






1 < i < f , 




< {<- 


— Z + 


m 
6 


1} 


f + 1 < i < n- 


m 
2 


{n — 


2m i i 
3 5 ■ * 


. ,n} 




n-f +1 <i< 


?J. 



J; 



For each x = (xi, . ■ ■ , Xd) G let J x denote the block 

J X1 X . . . , X Jx d 5 

and define the inner block J x similarly. We classify vertices by the number of 
sides they are close to defining ip(x) = #{1 < i < d : max{i, d — i + l} < y}. 
Then for each x G there exists a graph isomorphism tp x from J x into 
jivi*)) such that J x is mapped to B C T { ^ x)) . It can easily be checked that 
for each < j < d and large n, 



-m- 



j n d ~ j < Mj < 2 d m j n d - j , 



where Mj = \{x : (p(x) = j}\ denotes the number of vertices of type j. 



3.2) 



By Theorem |2.3| the dynamics on all relevant rectangles has log-Sobolev 
constant at least a s . 

6.1. Upper bound on the L 1 -distance. With the framework of ^4] as 
our starting point, define the barrier dynamics as before except we replace 
balls in the graph distance with balls in the L°°-distance on the lattices as 
these are rectangles whose log-Sobolev constant can be bounded using The- 
orem 
lemma. 



2.3 With this minor modification of Lemma 4.7 we get the following 
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Lemma 6.2. Let Q s be the barrier- dynamics operator on a d- dimensional 
rectangle with side-lengths at most n and any combination of arbitrary and 
periodic boundary conditions. Let W s be its update sequence up to time s 
for s = 100d 2 2,T 2 log log ra and let S be the collection of sparse sets of G. 
For sufficiently large n we have P(Aw s G S) > 1 — n~ 10d and furthermore 
F(v G A W J < log~ 10d n for every v G V. 

Let A G S be a sparse subset of partitioned into L = L(A) components 
{Ai\f =l according to Definition 4.1. For each Ai let yi be a representative 
element chosen according to any arbitrary rule. We will classify the compo- 
nents according to the value of ip(yi) and define 

Lj = #{i : tf(yi) = j}. 

By the diameter bound on the components A4 we have that A{ C J Vi . 

For each i let 71 be a copy of TVPivw an d l e t G* denote the graph given by 
the union of these components, all disconnected from each other inheriting 
the boundary conditions of the %. By a slight abuse of notation will let 
■0i = denote the map taking J yi into the copy 71- By construction it also 
maps J Vi into the corresponding inner block B* . Finally, denote A* as the 
image of A. 

Let XI denote the Glauber dynamics on G* and fj,* its stationary dis- 
tribution. We couple Xt and XI as follows: Let Af = S^(r) where 
r = \ 2Qd 2 a~ l lognj as in equation (4.1). Note that by the sparseness prop- 
erties of the Ai the Af are disjoint. Now we couple the two dynamics so 
that if x is in some Af then we couple the updates of x and ipi(x). For all 
other vertices in and G* we can take the updates to be independent and 
we set initial conditions such that 

X (x) = X^i{x)) for all i and x G Af . (6.3) 

Repeating the proof of Lemma |4.9| with minor modifications we have that 

F X0 (X t (A)e-)-(j,\ A \\ TV -\\F x *(Xf(A*)e-)-^^ < n" M / 3 (6.4) 



and hence applying Lemma 4.6 for t < 2da s 1 logn we have that, 

\\F X0 (X t+s G •) — HU < E \\F x *(X;(U t B*) G •) — //| A || Tv +2n- M / 3 . (6.5) 
As X t * is a product chain we can apply Proposition [7] giving us that 



max||P x5 (X*(Ui£*) G •) -//|a|| tv < ^=0^^ • (6-6) 
The randomness here is in the expected values of Lj which we bound using 



Lemma 6.2 



ELj < ^ P(v G A Ws ) < Nj \og- wd n . (6.7) 

v£lh:<p(v)=j 
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Combining equations (6.2), (6.5), (6.6) and (6.7) using Jensen's inequality 
we have that 



XQ 



6.2. Lower bound on the L 1 -distance. For a lower bound on the total 
variation we embed small blocks into Z^ of type roughly proportional to A/}. 
Taking the vertices 



(lft)i=l,...,£ :={(xi,...,x d )eZ*: ( Xj - 1) /(2m) G Z, 1 < j < d} C Z 



d 

n > 



the blocks J yi are disjoint and Lj = \{i : ip{yi) = j}\, the number of vertices 
of type j, satisfies 

Lj > 3- d m~ d n d - j , 

for large n. 

For each i let 71 be a copy of TvciVi)) an d let G* denote the graph given 
by the union of these components, all disconnected from each other. Let 
ipi = iftyt denote the map taking J yi into Ti- By construction it maps J yi 
into the corresponding inner block B{. Denote A = DiJ yi and A* = Uj-Bj. 

Let XI denote the Glauber dynamics on G* , inheriting the boundary 
conditions of the blocks 71, and let fi* denote its stationary distribution. As 
in the upper bound we couple the dynamics on G and G* so that if x is in 
some J Vi then we couple the updates of x and ipi(x) and for all other vertices 
in lj d we take the updates to be independent. We will assume our initial 
conditions satisfy Xq(x) = X^iipiix)) for all i and x G J Vi . 



As in the case of the upper bound, with minor modifications of Lemma 4.9 



and Lemma 4.6 we have that for t < 2a s 1 dlogn, 



max \\r x 

XQ 



(X t G •) " MlU > m F \\F X * (X,*(A*) G •) - ^. || TV - 2n" 2d / 3 . 



It is sufficient to consider this range of t since a minor modification of the 
proof of equation (|4.7|) implies that for all 5 > and large enough n(S) we 



have that t M ix(<5) < 2S S 1 dlogn. Now, similarly to equation (4.11), when t 
is of order log n then 



XQ 



(^ t *(Bi)€-)-M*|B i || L oo (wi|fl . ) =o(l). 
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Hence applying Proposition [7] we have that 
max\\F Xo (X t G •) - /i|L 

> ^^maxHP^^;^) G 0-M*klli 2 (^) 



o(l) 



> 



(6.9) 



•j=0 



where = 2$(5^/x) - 1. 



6.3. Existence of Cutoff. We are now able to prove establish the existence 
of cutoff for Xf. Define the approximate mixing time as 



t* = inf i 



t>0 : Y, 3 ' dm t 

3=0 



-d m (j) n d-j log -3d n < logn 

Let Xj(m) denote the second eigenvalue of the dynamics on %. Then a^ 1 < 
\j(m) < 1 and so by Theorem 2.1 we have that 



m 



U) 



max 

XO 



< exp(— Xj(m)t + 0(a s 1 log log n)) , 



(6.10) 



and so t* < (d + o(l))S s 1 logn. Hence we can apply equation (6.8) to show 
that 



maxHP^pr, 



■t'o 



t*+i 



e ■ 



Ml 



< \/d6 d log l - 4d n + 2n 



-2d/3 



o(l). 



while by equation (6.9) we have that 

M|| TV >*(logn)-o(l) = l-o(l), 



max | 

x 



XO 



where ^>(x) = 2<£(|y / x) — 1. It follows that t Mai (e) is between t* and t* + s 
for any < e < 1. Recalling that results of [20] imply that the L 1 mixing 
time of the Glauber dynamics for the Ising model on (Z/nZ) d has order at 
least logn, we have that t* is also of order at least logn and hence Xt has 
cutoff at t* with a window of width at most s = 100d 2 2~ 2 log logn. 



6.4. Cutoff Location. The rest of the section concerns relating the quan- 
tities to eigenvalues of infinite volume Glauber dynamics to estimate t* 
and the cutoff location. 
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Lemma 6.3. There exist Xj such that Xj(m) — > Xj as m 
there exists c = c(/3,j, d) > such that for m = log 3 n, 



oo . Moreover, 



| Xj (m) — Xj | < 



c log log n 
log n 



and for < t < 2da s 1 log n and n sufficiently large, 



s — Aj t — c log log n 



32n" 4d/3 <m ( t j \m) <e 



Ajt+c log log n 



(6.11) 



Proof. In the proof of the upper bound in Section |6.1| we could apply the 
same analysis to the Markov chain X^'- 1 on 7~ti) but crucially mapping com- 
ponents of the update support into blocks T% of side-length log 3 n (rather 
than log 3 m). By construction, all components of the update support will 
be of the form T (j) . With s = 100d 2 a" 2 log log n, an adaptation of equa- 



tion (6.8) gives us that 



max 



*,3 



< \J 2 d m { t j) log~ 7d n + 2n 



id 



-2d/3 



(6.12) 



when t < 2da~ logn. We can also vary the size of the blocks that the 
components of the update support are mapped into for instance taking them 
of side- length m! G [|m, |m] in which case we get the bound 



max 



X i (Xy s G •) — < J2 d m ( t j \m') log-™ n + 2n" M / 3 . (6.13) 



Let $l* m denote the state space of X^' J . Since loglog(l//Lt* J (cr)) > (3d + 
o(l)) log logn for all a € VL* m and since Xj(m) < 1 (vertices are updated at 



rate 1), Theorem 2.1 implies that for large n 



m ( t j \m') < exp ( 1 - Xj{m) ( t - ^- log log (1/ u* J (a)) 



< e 



-A^(m')t+^gfii log logn 



(6.14) 



Now a standard lower bound on the total variation distance in terms of 



the spectral gap (cf. its discrete-time analogue 27, equation (12.13)]) gives 
that for all t > 0, 



-A(m)i ^ 2 



max 



x, 



¥(X* J <E 



(6.15) 



(i 
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Combining equations (6.14), (6.13) and (6.15) we have that for any < 

3 

5Aj (m') 



t < 5A J m ,-, logn, s = 100d 2 a s 2 log log n and m! € [fm, fm], 

P(4 e 



-Aj(m)(t+s) ^ 2 



max 
1 " 



/i 



< 2^ 2 d m\ j) (m') log~ 7d n + 4n~ M/3 

< e -A J (m / )t+ 3d 4 t° s (1) log logn 



(6.16) 

Since Xj(m') < 1 for all m! we may take t = \ logn = ^m 1 ^ 3 and get that, 



^m 1/3 + 100d 2 a" 2 log 



in 



A(m) > 



-m 



1/3 



d + o(l) 1 
Aj (m j — log m . 



4a s 



and hence with c\ 



2rf+o(l) 
4o s 



+ 200d 2 S 



2^-2 



Xj(m) - \j{w!) > -(ci + o(l))^|^ . 



Rearranging the role of m and m! we have that for all m' £ [|m, |m] then 

A iM - A i( m') < ( Cl + o(l))^ < (2c a + (1))^ . 
Combining the previous two equations we have that for all m! G [§m, |m] 



|A,(m)-A i (m')|<(2c 1 + o(l))^ 



(6.17) 



This implies that Xj{m) converges to some limit Xj as m — > oo and that 



Mm) - Xj\ < lAiCd)*"^) - A,((|) fc m)| 



fc=l 



<yj(2 C1+ „ (1 )) log «i> im » 



< 



(§)*/s TO i/a 
35ci log m 105ci log log n 



m 



1/3 



log n 



(6.18) 



for large m. 

Substituting (6.16) into (6.18) and using the fact that x < y + z implies 
2y 2 > x 2 — 2z 2 we have that for a large enough constant c and large n, 

e -2Xjt-c log logn _ 32 n _4d / 3 < tn^(m') < e ~ 2 ^*+ clo s lo g n 
which completes the lemma. ■ 



Lemma 6.3 implies that 
t* 



min (d — j)X, 1 ) logn 
o<i<d-i '3 J 



< c log log n . 



12 



EYAL LUBETZKY AND ALLAN SLY 



The last step in the proof of Theorem 



6.1 



will be to show that Aj = Xj 



(00) 



Let Y} denote the Glauber dynamic on 



(t(v)-Ect(v) 



Consider the function 



and let 



By the characterization of the spectral gap as the slowest exponential rate 
of decay to of K yo f(Yt) for mean zero L 2 functions under the action of the 
semi-group we have that 

exp(-Af°) ^limsup^P'y/'. 

As before let X^' 3 denote the Glauber dynamics on the block T^' of side- 
length m = log 3 n. Let 

B m = {{ Xl , . . . ,x d ) £ Z d : < x { < m/2} , 

which we will associate with subsets both of 7^) = and Mj in the nat- 
ural wa y. B y another disagreement percolation argument of the type in 



Lemma 



4.4 



we can couple and X*^ starting from the all plus configu- 
ration so that 



-(YlohBm) + X^ n (B m )) < exp(-log 2 n) 



(6.19) 



and similarly for all minus initial configurations. By the standard monotone 
coupling and symmetry of T^> 



max 

x 



< 2 d E [E + X*' J -E^X* d 
By the coupling we have that of Y^ and X*' 3 it follows that, 



,d+l c U) _|_ m 3 e -log 2 n_ 



(6.20) 



log n 



(6.21) 



— Mog 71 

The standard lower bound on the total variation distance in terms of the 
spectral gap (cf. its discrete-time analogue |27l equation (12.13)]) gives that, 



-\j (m) log n ^ 2 



max 



log n 



(6.22) 



and so combining equations (6.20), (6.21) and 6.22 we have that 

1 / log n 



e~ A ^ < limsup (2m) d+i £ 



"log n 



+ m e 



- log n 



. (00) 

< e" A i 
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and hence Xj > X^°°\ 

It remains to prove that Xj < A^- . Now fix e > and recall the Dirichlet 



form (2.4), according to which 



,(oo) . , <>, (/> /) 

K = mf ^ — tt\ j 

3 feL2({±i} B 3, H ) Var Aioo (/) 

where fij is the stationary measure of the infinite- volume Ising model. For 
any / G L 2 ({±1} H J ', with <^ -(/, /) < oo we can find a sequence of 
functions f n G L 2 ({±1} H -? , /ij) each of which depends only on a finite number 
of spins such that f n ->■ / in L 2 ({±1} H ^ , ^) and ^ (/„, / n ) -> ^ . (/, /) (see 



e.g. the proof of 28, Lemma 4.3]). So take g G L ({±1} ^ , fj,j) depending 



only on a finite number of spins such that 

Var^fo) " Var M ,(/) £ ' 

For some large enough M we have that g is a function of the spins in the 
box Am = {x G Hj : ||a;||oo < M}. We compare the Ising model on Mj and 
on with side-length m, identifying the vertices Am C Mj with those in 
A* M = {x £ T (i) : x mod m G Ajvf}. By the strong spatial mixing property, 



\Vj\A M ~ fJ>m\A* u \\L° 



as m — > oo and hence 



~P — — = lim — > hm Xj(m) = A,- 

Var^ (5) m^oo Var^j (g) 



j ' 



where the inequality follows from the characterization of the spectral gap 
by the Dirichlet form. This implies that 

. (do) • t &Hi\fi /) . c 
' / Var Mj .(/) 



and so A^ 00 "* > Aj which completes the proof of Theorem 



6.1 
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